K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 61 | 84 | 95 | 97 | 97 |
1000 | 257 | 608 | 818 | 926 | 960 |
10000 | 655 | 2323 | 4522 | 6611 | 8022 |
100000 | 1850 | 9218 | 23553 | 41086 | 57948 |
1000000 | 8026 | 40986 | 132958 | 276054 | 424775 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings